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Abstract — This paper presents prefix codes wliicli minimize 
various criteria constructed as a convex combination of maximum 
codeword lengtli and average codeword length or maximum 
redundancy and average redundancy, including a convex combi- 
nation of the average of an exponential function of the codeword 
length and the average redundancy. This framework encompasses 
as a special case several criteria previously investigated in the 
literature, while relations to universal coding is discussed. The 
coding algorithm derived is parametric resulting in re-adjusting 
the initial source probabilities via a weighted probability vector 
according to a merging rule. The level of desirable merging has 
implication in applications where the maximum codeword length 
is bounded. 

I. Introduction 

Lossless fixed to variable length source codes are usually 
examined under known source probability distributions, and 
unknown source probability distributions. For known source 
probability distributions there is an extensive literature which 
aims at minimizing various pay-offs such as the average 
codeword length ||T|, the average redundancy of the codeword 
length f2\, f3], the average of an exponential function of 
the codeword length |4j|-|j6J, the average of an exponential 
function of the redundancy of the codeword length ||6j, 
|[7|. On the other hand, universal coding and universal mod- 
eling, and the so-called Minimum Description Length (MDL) 
principle are often examined via minimax techniques, when 
the source probability distribution is unknown, but belongs to 
a pre-specified class of source distributions |2|, |8|-|11|. 

This paper is concerned with lossless coding problems, in 
which the pay-offs are the following. 1) A convex combination 
of the maximum codeword length and the average codeword 
length, or a convex combination of the maximum pointwise 
redundancy and the average pointwise redundancy of the code- 
word length, and 2) a convex combination of the average of an 
exponential function of the codeword length and the average 
codeword length, or a convex combination of the average of 
an exponential function of the pointwise redundancy and the 
average redundancy of the codeword length. 

These are multiobjective pay-offs whose solution bridges 
together an anthology of source coding problems with different 
pay-offs including some of ones investigated in the above 
mentioned references. Moreover, for 1) there is parameter 
a € [0, 1] which weights the maximum codeword length 
(resp. maximum pointwise redundancy of the codeword) while 
(1 — a) weights the average codeword length (resp. average 



redundancy of the codeword), and as this parameter moves 
away from a — the maximum length of the code is reduced 
resulting in a more balanced code tree. A similar conclusion 
holds for 2) as well. 

A. Objectives and Related Problems 

Consider a source with alphabet X = {xi, X2, ■ ■ ■ , x\x\} 
of cardinality \X\, generating symbols according to the 
probability distribution p = {p{^) ■ ^ ^ = 
(p{xi),p{x2), ■ ■ ■ ,p{x\x\))- Source symbols are encoded into 
-D— ary codewords. A code C = {c{x) : x £ X} for 
symbols in X with image alphabet V = {0,1,2, D - 1} 
is an injective map c : X ^ V*, where V* is the set of 
finite sequences drawn from V. For x E X each codeword 
c{x) G 'D*,c{x) e C is identified with a codeword length 
l{x) E Z+, where Z+ is the set of non-negative integers. 
Thus, a code C for source symbols from the alphabet X is 
associated with the length function of the code I : X ^ Z+, 
and a code defines a codeword length vector 1 = {l{x) : x E 

X} = (^l{xi),l{x2), ■ ■ ■ ,l{x\x\)) G z'j^'. Since a function 
I : X Z_|_ is the length function of some prefix code if and 
only if it satisfies the Kraft inequality y^,], then the admissible 
set of codeword length vectors is defined by 

C{Z^f)^{lEZ^f ■.Y.D-'^-^ <l}. 

xex 

On the other hand, if the integer constraint is relaxed by 
admitting real-valued length vectors 1 E M'"^' which satisfy the 
Kraft inequality, such as Shannon codes or arithmetic codes, 
then £(z'^') is replaced by 

£(MV^I)^{leMf : E^^'^'^^l}- 

xex 

Without loss of generality is it is assumed that the set of 
probability distributions is defined by 

V{X) = {p = {p{x,), . . . ,pixix\)) e mV^I : p{x\x\) > 0, 

p{xi) <p{xj),\/i > j, {x^,Xj) E X, ^pix) = l|. 

xex 

Moreover, log(-) = logjj(-) and IHI(p) denotes the entropy 
of the probability distribution p. The two main problems 
investigated are the following. 



Problem 1. Given a known source probability vector p G 
P(A') define the one parameter pay-off 

Lf°(l,p) = \amaiil{x) + il-a)y l{x)p{x)}, (1) 

xex 

and a slightly general version representing redundancy 
LMf °(1 + logp,p) = a max (l{x) + logp{x)) 

+ (l-a)(^/(xMa;)-H(p)) (2) 

xex 

where a £ [0, 1] is a weighting parameter The objective is to 

I X I 

find a prefix code length vector 1* G which minimizes the 
pay-off h'JO(^^^ p) LMf 0(1 + log^ p, p), for Va G [0, 1]. 

The pay-off L^^'^(l, p) is a convex combination of the maxi- 
mum and the average codeword length, and hence a weights 
how much emphasis is placed on the maximum and the aver- 
age codeword length. The extreme cases, a = corresponds 
to the average codeword length, and a — I corresponds to the 
maximum codeword length. The pay-off LK*^*^ (1+log p, p) is 
a convex combination of the maximum pointwise redundancy 
and the average redundancy of the codeword length. The maxi- 
mum pointwise redundancy is clearly the maximum difference 
between the length of the compressed symbol l{x) and the self- 
information of that symbol — logp(x), hence this maximum 
redundancy is minimized over the code lengths. To the best 
of our knowledge neither pay-offs defined in Problem [T] are 
addressed in the literature. Another class of problems which 
is also not discussed in the literature is the following. 

Problem 2. Given a known source probability vector p G 
¥{X) define the two parameter pay-off 

Lf„°(l,p)^flog(^p(.)i^"(^)) 

xex 

+ (l-a)^;(x)p(x), (3) 

xex 

and a slightly general version representing redundancy 

LMf °(1 + logp,p) ^ a] log ( Y: Pi^)D' 0(-)+'°«^(-))) 

xex 

+ (l-a)(^/(xMa;)-H(p)) (4) 

xex 

where a G [0, 1] is a weighting parameter and t G (— cxd, cxd). 
The objective is to find a prefix code length vector 1* G 
' which minimizes the pay-off Lf^ {I, p) or hRf^{l + 
lege P, p), Va G [0,1]. 

The two parameter pay-off L^^'^(l, p) is a convex com- 
bination of the average of an exponential function of the 
codeword length and the average codeword length. The pay-off 
LE*^'^(1 + logp, p) is a convex combination of the average 
of an exponential function of the pointwise redundancy and 
the average pointwise redundancy. For a or a = 1 the 
resulting special cases of Problem |2] are found in ||2j-||7]). 



Hence, for a = or a = 1 Problem [T] and Problem |2] 
are related to several problems previously investigated in the 
literature. The special cases Lf^^i'^(l, p), LM^/i<^(l + log p, p) 
are also the dual problems of universal coding problems 
formulated as a minimax, in which the maximization is over 
a class of probability distributions which satisfy a relative 
entropy constraint with respect to a given fixed nominal 
probability distribution (see |12)). 

Moreover, for any a G (0, 1) Problem [l] and Problem |2] 
are multiobjective problems; clearly as a moves away from 
a = more emphasis will be put on minimizing the maximum 
codeword length or maximum pointwise redundancy for Prob- 
lem [T] and the exponential function of the codeword length 
or pointwise redundancy for Problem [2] Relations between 
Problem [T] and Problem |2] and other pay-offs are established 
by noticing the validity of the following limits (which can be 
easily shown). 

lim - \ogr, ( y p(x)e''^'=A = maxl(x) (5) 
t— foo t V ^ — ' / xex 

xex 

Li^O(l,p) ^ lim Lf 0(l,p) (1,P) (6) 

LE^^o (1 + log p, p) ^ lim mf 0(1 + log p, p) 

= LM*^0(l + logp,p) (7) 

Since the multiobjective pay-off Lf^'^(l,p) is in the limit, as 
t oo, equivalent to limt^oo ^uai^, p) = L*^'^(l, p), Va G 
[0, 1], then the codeword length vector minimizing Lf^'^(l, p) 
is expected to converge in the limit as t — > oo, to that 
which minimizes L^^'^(l, p). A similar behavior holds for 
the multiobjective pay-off LM*^'^(1 + logp, p). 

II. Problem[T] Optimal Weights and Merging Rule 

The objective of this section is to convert the multiobjective 
pay-off of Problem [T] into one which is equivalent to a single 
objective of the form J^xex ''^a{x)l{x), in which 'Wa{x),x G 
X are the new weights which depend continuously on the 
parameter a G [0, 1]. Subsequently, we derive certain prop- 
erties of these weights associated with the optimal codeword 
lengths. The main issue here is to identify the combination 
rule of merging symbols together, and how this combination 
rule will change as a function of the parameter a G [0,1] 
so that a solution exists over [0, 1]. From these properties the 
Shannon codeword lengths for Problem [T] will be found. 

Define 1* = maxZ(a;), U ^ \x e X : l{x) = 1*} . 

xex L J 

Then, the pay-off L^^'^(l, p) can be written as 

L^'^ il,p) ^ al* + {1- a) J2l{x)p{x) 

xex 

= (a + (1 - a) ^ p{x)) r + ^ (1 - a)p{x)l{x) 
xeu x<f_u 



where the set U remains to be identified. Define 



E 



Wa{x) 



A 



A 



(l-a)^p(a 



xeu 



Waix) = (1 — a)p{x), X ^U. 
Then the pay-off L*^'^(l, p) can be written as follows: 



L«°(l,p) = L""(l,w.) 



xex 



Wa{x)l{x), Va e [0,1] (8) 



where the weights Wa{x) are functions of a and the source 
probability p e V{X). It can be easily verified that the 

new weight vector Wq = {wa{x) : x E X} is a prob- 
ability distribution since < Wa{x) < 1, \/x G X and 
J2xex ''^o^i-'^) ~ 1,Vq; G [0,1]. The next lemma describes 
how the weight vector behaves as a function of the probability 
vector p and a G [0, 1]. 

Lemma 1. Consider pay-offh^^^ (1, p). Given any probability 
distribution V{X) the following hold. 

1. If p(x) < p{y), then Wa{x) < Wa{y) Vx, y S A", a e [0, 1]. 
Equivalently, Wa{xi) > 'Wa{x2) > ■•• > Wa{x\x\) > 0, for 
all a G [0, 1]. 

2. For y ^lA, Wa{y) is a monotonically decreasing function of 
a e [0, 1], and for x G U, Wa{x) is a monotonically increasing 
function of a G [0, 1]. 

Proof: There exist three cases; more specifically, 
1) x,y ^ U: then Wa{x) = (1 — ct)p[x) < (1 — oi)p{y) — 
Wa{y), V a e [0,1]; 2) x,y G U: Wa{x) = Wa{y) = = 
miuxex Wa{x); 3) x GU,y (or x ^U,y E U): Consider 
the case x GlA and y ^U. Then, 

- ~p{v) < 0, 



da 

dWa{x) 

da 



1 dw*^ 
\U\ da 



1-Y.p{x) 

xeu 



> 0, 



(9) 



(10) 



According to (|9]), ( [TO] i, fory^U the weight Wa (y) decreases, 
and for x eU the weight Woi{x) increases. Hence, since Wa{-) 
is a continuous function with respect to a, at some a = a', 
Wa'{x) = Wa'{y) = w*/. Suppose that for some a = a' + 
da, da > 0, Wa{x) ^ Waijj)- Then, the largest weight will 
decrease and the lowest weight will increase as a function of 
a E [0, 1] according to (|9| and ([TO]l, respectively. ■ 

Remark 1. (Special Case) Before deriving the general coding 
algorithm, consider the simplest case when \U\ — 1, that is 

Wa{x\x\) < Wa{xix\-i)- Then, 

L*'^0(l,p) =[a + {l- a)p{x\xi))l* + - 

x(^U 

In this case, the weights are given by Waix) = (1 — 
a)p{x), X ^ U and Wa{x\x\) = « + (! — a)p{x\x\)- Thi^ 
formulation is identical to the minimum expected length prob- 
lem provided a E [0, 1] is such that Wa(x\x\) < Wa(x\x\-i)- 
Hence, for any a E [0, ai) defined by 

P\x\-i -P\x\ 



a\ 



'P\x\-i -P\x\ 



(11) 



the codeword lengths are given by — log (x) ,xeX. For 
a > ai the form of the minimization problem changes, as 
more weights Wa{x) are such that x ElA. The merging rule 
on the weight vector Wq for any a E [0, 1] so that a solution 
to the coding problem exists for arbitrary cardinality \IA\ and 
any a E [0,1] is described next. 

Consider the general case when \U\ E {1, 2, . . . , jA'j — 1}. 
Define a^ = {) and 

ak ^ min [a E [0, 1] : Wa{x\x\-(k-i)) = Wa{x\x\-k)] , 

Aafc = Q!fe+i - ckfc, kE{l,...,\X\-l}. 

That is, since the weights are ordered as in Lemma [T] ai 
is the smallest value of a e [0, 1] for which the smallest two 
weights are equal, 'Wa{x\x\) = Wa{x\x\-i)7 "2 is the smallest 
value of a e [0, 1] for which the next smallest two weights 
are equal, Waix^x]-!} — ^^0(2:1^^1-2)^ etc, and is the 

smallest value of a G [0, 1] for which the biggest two weights 
are equal, Wa{x2) — Waixi). For a given value of a e [0, 1], 
we define the minimum weight corresponding to a specific 
symbol in X by w*^ — minj;^;^ Wo,{x). 

Since for k = Q, Wao{x) — wo{x) = p{x),yx E X, is the set 
of initial symbol probabilities, let Uq denote the singleton set 
{a;|;^^|}. Specifically, 

Uq = \ x E {xix\} -P* = minp{x) ^ p{xix\) \ ■ (12) 
I xex \ 

Similarly, Ui is defined as the set of symbols in {a;|;t^|_i, x\x\] 
whose weight evaluated at ai is equal to the minimum weight 



Ui^ ^x E {x\x\-i,x\x\} ■■ Wa^{x) = (13) 

In general, for a given value of au, fc G {1, . . . , | <%"[ — 1}, we 
define 

Uk^ \x E {X\x\-k, ■ ■ ■,X\x\} ■ Wak{x) = (14) 

Lemma 2. Consider pay-off 1^^'-^ {X^^p)- For any probability 
distribution p E P(A') and a E [afe,Q!fe+i) C [0,1], k E 
{0,1,2, . . . , [A"! - 1} then 



Wa{x\X\-k) = Wa{x\X\) = 

and the cardinality of set Uk is \lAk | = fc + 1. 



(15) 



Proof: The validity of the statement is shown by per- 
fect induction. At a = ai, Wa{x\x\) = ""^0(2:1^^1-1) < 



Wa{x 



\X\^2) 



< 



< Wa{xi). Suppose that, when a 



ai + da, da > 0, then Waix\x\) 7^ Wa{x\x\-i)- Then, 
L*^0(l,p) - (a + (1 - a)piy))r + ^(1 - a)pix)lix) 

xff_U 

and the weights will be of the form Wa{x) = (1 — a)p{x) and 
Wa{y) = a + (1 - a)p{y) where y E {xix\, x\x\-i}- Thus, 

dWa{x) 

da 
dwa{y) 



da 



-p{x) < 0, X (^U 
l~p{y) > 0, yEU. 



(16) 
(17) 



Hence, the largest of the two would decrease, while the 
smallest would increase and therefore they meet again. This 
contradicts our assumption that Waixix]) 7^ Wa{x^x\-i) for 
a > ai. Therefore, Wa{x^x\) = Wa(x^x\-i), ^ct e [ai, 1). 

Secondly, in the case that a > ak, , k G {2, . . . ,\X\ — 1}, 
we suppose that the weights Wa{x\x\) = Wa{x\x\-i) ■ ■ ■ = 
. . . = Wa{x^x\-k) — W^- Hence, the pay-off is written as 

L*'^o (1, p) = (a + (1 - a) ^ p{x)) r + ^ (1 - a)pix)l{x) 



By letting, 5k{ct) = a — a^, then V x EUk 

w*Jx) = (24) 

whereas V a; <^ I4k, Wa{x) = (1 — a)p{x). When Skia) — 
ak+i - ak, that is a = a^+i, then Wa{x\x\-{k+i)) = w*^{x) 
and therefore. 



Thus, 

dWa{x) 

da 



-p{x) < 0, X 



(18) 



1^1^ = 1 ~ T.Pm-j > 0, fc e {2, . . . , - 1}. (19) 

Finally, in the case that a > ak+i, A: e {2, . . . , \X\ — 2}, if 
any of the weights wixi-jia), Vj € {0, . . . , A: + 1}, changes 
differently than another, then, either at least one probability 
will become smaller than others and give a higher codeword 
length, or it will increase faster than the others and hence 
according to ([TSj it will decrease to meet the other weights. 
Therefore, the change in this new set of probabilities should 
be the same, and the cardinality of U increases by one, i.e., 
Uk+i^\k + 2\, k€{2,...\X\-2}. m 
The main theorem which describes how the weight vector Wq, 
changes as a function of a e [0,1] so that there exist a solution 
to the coding problem is given in the next theorem. 

Theorem 1. Consider pay-off h^^'^ {I, p). Given a set of prob- 
abilities p e P(A') and a G [ak, a^+i), fc G {0, 1, ... , \X\ — 



1}, the optimal weights wj^ = {wl^{x) : 
{wl{xi),wl{x2), . ■ .,wl{xix\)) are given by 

[ (l-a)p(x), xiUk 

wi{x) = { , E.g^,P(^) (20) 
\w^^{x) + {a~ak) TTTH ,x£Uk 



G X) 



where lAk is given by ( |14[ ) and 



^ ^ ^ {p\x\-(k+i) ~ P\x\^k) 

ak+i=ak + {l-ak)Y——^) • (21) 

+P\x\-(k+i) 

Proof: According to Lemma |2] the lowest probabilities 
become equal and change together forming a total weight 
given by 

fc 

^Wa{x\x\-o) = \Uk\wl{x) 
j=o 

= a + (1 - a)pix\ + . . . + (1 - a)pix\^k- 

Hence, 

fc 

= -^Pi^\x\-j) (22) 



' da 



da 



\Uk 



\Uk 



(23) 



(1 - ak+i)p{x\x\-{k+i)) = wl^{x) + 4(a) 
and thus, after manipulation ak+i is given by 



T^x^u^ p{^ 

\Uk\ 



, n -. Pi^ixi-ik+i}) - P{xix\-k) 
ttfc+i = afc + (1 - afc) j2 I p{x) ■ ^^^^ 

+P\X\-{k+l) 



III. Optimal Code Lengths 
This section presents the optimal real-valued codeword length 

I X I 

vectors 1 G £(M^ ) of the multiobjective pay-offs stated 
under Problem [T| and Problem [2j for any a G [0, 1] and 

t G [0,00). 



Theorem 2. Consider Problem [7] For any probability distri- 
bution p G V{X) and a G [0, 1] the optimal prefix real-valued 
code 1 G m'i^' minimizing the pay-off {V,^) is given by 



log {{I- a)p{x)^ for x^Uk 

log( ^^^^ UorxeUk 



where a G [ak,ak+i) C [0, 1], Vfc G {1, . . . , \X\ - 1}. 

Proof The pay-off to be minimized is given by ([HJ. It can 
be easily verified that the new weight vector Wq = {wa{x) : 
x G X} is a probability distribution since < w^ix) < 
1, Vx E X and J2xex ^a{x) — l,Va G [0, 1]. Therefore, as 
in Shannon coding the optimal codeword lengths are given by 
minus the logarithm of the optimal weights. ■ 
Note that for a = Theorem [2] corresponds to the Shannon 
solution V^{x) — ~\ogp{x), while the solution for a = 1 
is the same as the solution for all a taking values in interval 
Q! G [ck|A'|-ii 1] over which the weight vector is identically 
distributed, and hence l]^{x)\a=i = The behavior of 
Wa{x) and Zj,(a;) as a function of a G [0, 1] is described in 
the next subsection via an illustrative example. The solution of 
the multiobjective pay-off LMa(l + logp,p) which involves 
the pointwise redundancy is omitted since it is characterized 
similarly. 

Theorem 3. Consider Problem |2] For any probability distri- 
bution p G V{X) and a G [0, 1] the optimal prefix real-valued 
code 1 G m'I^' minimizing the pay-off (X^'p) is given by 



llai^) = - log (avtix) + {l-a)p{x)y 



xeX (26) 



where {vt.ai^) : x e X} is defined via the tilted probability 
distribution 



A 



D 



t ij 



(27) 



Proof: By invoking the Karush-Kuhn-Tucker necessary 
and sufficient conditions of optimality one obtains the follow- 
ing set of equations describing the optimal codeword lengths. 



= ai^t^aix) + (1 - a)p{x),x e X 



(28) 



which gives (26 1. ■ 
Note that the solution stated under Theorem [3] corresponds, 
for a = to the Shannon code, which minimizes the 
average codeword length pay-off, while for a = 1 (after 
manipulations) it is given by 



Y^logp(a;)+log(^p(a;)i+*), x£X 



which is precisely the solution of a variant of the Shannon 
code, minimizing the average of an exponential function of 
the codeword length pay-off |j5), ||6|. The solution of the 
multiobjective Payoff hf^^{l + log^p,p) corresponding to 
pointwise redundancy is obtained similarly as in Theorem [3] 
The optimal codeword lengths are given by 



4,a(2:) = - log (aiJ-t.aix) + (1 - a)p{x)j , 



xeX (29) 



where {fJ.t.a{x) : x E X} is defined via the tilted probability 
distribution 



Mt,a(a;) 



j^*'L(^y+i(a;) 



X € X. 



(30) 



The only difference between the optimal codeword lengths 
of pay-off L^'^(l + log^p,p) with respect to the pay- 
off L*^'^(l,p) is the term j3*+^(a;) appearing in the tilted 
distribution. When a — 1 (29 1 is precisely a a variant of 
the Shannon code, minimizing the average of an exponential 
function of the redundancy of the codeword length pay-off (3), 

Remark 2. 1. The Limiting Case as t ^ oo: The minimization 
of the multiobjective pay-offl^^^^ (l, p) obtained in Theorem^ 
is indeed obtained from the minimization of the two parameter 
multiobjective pay-off l^f^^ {\,\)) in the limit, as t oo. In 
addition, limt^oo L*^''(l, p) = L*^'^(l, p), VI and hence at 
1 = 1^^. The point to be made here is that the solution of 
Problem^can be deduced from the solution of Problem^in 
the limit as t ^ oo provided the merging rule on how the 
solution changes with a € [0, 1] is employed. 
2. Coding Theorems: Although, coding theorems for Problem^ 
and Problem^ are not presented (due to space limitation), 
these can be easily obtained either from the closed form 
solutions or by following 



A. An Algorithm for Computing the Optimal Weights 

For any probability distribution p € ¥{X) and a e [0, 1] an 
algorithm is presented to compute the optimal weight vector 
w„ for any a G [0, 1]. 



p(i'i) = ?u„„(xi) 













i '^^^^ — - 




a - o:, a = 1 




Weight o £ [0, 1) 



Fig. 1. A schematic representation of the weights for different values of a. 

It is shown in Section |ll] (see also Figure [TJ that the weight 
vector changes piecewise linearly as a function of a e 
[0, 1]. Therefore, to calculate the weights Wa{x) for a specific 
value of a e [0, 1], one is only required to determine the values 
of a at the intersections by using ( |2T| , up to the intersection 
(see Fig|T]) that gives a value greater than a or up to the last 
intersection (if all the intersections give a smaller value of a). 
Thus, one can easily find the weights at a by using (|20]l. 



Algorithm 1 Algorithm for Computing the Weight Vector Wq, 
initialize 

P= {p{xi),p{x2),---,p{xiX\)) ,a^a 
fc = 0, ao = 
wliile a > ak do 
Calculate ak+i- 

P{x\x\~ik+i)) -p{x\x\-k) 



"fc+i = Qffc + (1 - ak) 



kti + Pi^\x\-ik+i)) 



k ^ k + l 
end while 

Calculate wt: 

for w = 1 to \X\ - (fc + 1) do 

w^ixv) = (1 - a)p(x^) 

V ^ V + 1 
end for 

Calculate (x): 

w* (a) (1 - ak)p{xix\-k) + {a- ak) 
for v^\X\-kto \X\ do 

w^{xy) = wl{x) 

W -s— U + 1 

end for 
return wt . 



^xtU, Pj^ 

k + 1 



B. Illustrative Example 

Consider binary codewords and a source with \X\ — A and 
probability distribution p=(^ ^ TE )■ Using 

the algorithm one can find the optimal weight vector w'l' 
for different values of a € [0, 1] for which pay-off ([T]i of 
Problem [T] is minimized. Compute ai via ( |2T] i, ai — 1/16. 
For a = ai — 1/16 the optimal weights are 



wl{a) 
w\(a) 
w\{a) 



w\{a) = (1 - a)p3 
(1 - a)p2 = ^ 
(1 - a)pi = \ 



1 



In this case, the resulting codeword lengths correspond to the 
optimal Huffman code. The weights for all a e [0, 1] can be 
calculated iteratively by calculating ak for all k E {0, 1, 2, 3} 
and noting that the weights vary linearly with a (Figure |2]l. 
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Fig. 2. A schematic representation of the weights for different values of a 
whenp=(A,_|,^,_^). 



IV. Conclusion and Future Directions 

Two lossless coding problems with multiobjective pay-offs 
are investigated and the idealized real-valued codeword length 
solutions are presented. Relations to problems discussed in 
the literature are obtained. Based on the insight gained in this 
paper, Huffman like algorithms which solve this problem are 
part of ongoing research. 
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